R2E: Rule-based Event Extractor
نویسندگان
چکیده
In this paper we present a rule-based method of event extraction from the natural language. We use the Stanford dependency parser in order to build a relation graph of elements from input text. This structure along with serialized extraction frames is converted into a set of facts. We describe a process of creation of application of rules, which aims to match elements from the text with corresponding slots in the extraction frames. A possible match is derived by the comparison of verbal phrases from the text with lexicalizations of anchors (constituting the most important part of each frame) stored in an ontology. The rest of the extraction frame is filled with other elements of the dependency graph, with regard to their semantic type (determined by lexicalizations of allowed types defined in frames and ontology) and their grammatical properties. We describe conversions required to create a consistent knowledge base of text phrases, classification of semantic types and instantiated slots from the extraction frames. We use the Drools engine in order to extract events from such a knowledge base.
منابع مشابه
IsoQuest Inc.: Description Of The NetOwl (TM) Extractor System As Used For MUC-7
IsoQuest used its commercial software product, NetOwl Extractor, for the MUC-7 Named Entity task. The product consists of a high-speed C engine that analyzes text based on a configuration file containing a pattern rule base and lexicon. IsoQuest used the NameTag Configuration to recognize proper names and other key phrases in text, and mapped the product’s extraction tags to the MUC-7 NE tags. ...
متن کاملGenetic Programming Fuzzy Rule Extractor Using Class Preserving Representation
This paper describes a genetic programming approach to the construction of fuzzy classification system with if-then fuzzy rules. Recently many research studies were focusing on utilisation of evolutionary techniques for automatically extracting fuzzy rules from data. In this paper we present a method based on genetic programming with a special structure preserving representation and special rul...
متن کاملA General-Purpose Rule Extractor for SCFG-Based Machine Translation
We present a rule extractor for SCFG-based MT that generalizes many of the contraints present in existing SCFG extraction algorithms. Our method’s increased rule coverage comes from allowing multiple alignments, virtual nodes, and multiple tree decompositions in the extraction process. At decoding time, we improve automatic metric scores by significantly increasing the number of phrase pairs th...
متن کاملA case study on automated risk assessment of ships using newspaper-based event extraction
In this paper we describe an event-type extractor on top of a distributed search engine. We apply this event-type extractor in a case study concerned with assisting maritime security operators to assess potential risk factors of ships. Based on a corpus of maritime-related press releases we automatically investigate the history of ships as they enter an area of interest. The performance of the ...
متن کاملA Rule Extractor for Diagnosing the Type 2 Diabetes Using a Self-organizing Genetic Algorithm
Introduction: Constructing medical decision support models to automatically extract knowledge from data helps physicians in early diagnosis of disease. Interpretability of the inferential rules of these models is a key indicator in determining their performance in order to understand how they make decisions, and increase the reliability of their output. Methods: In this study, an automated hyb...
متن کامل